Frederic Dambreville! 


'Delegation Generale pour l Armement, DGA/CTA/DT/GIP/PRO 
16 Bis, Avenue Prieur de la C* ote d’Or 
94114, Arcueil Cedex France 


Probabilized logics related to 
DSmT and Bayes inference 


Published in: 

Florentin Smarandache & Jean Dezert (Editors) 

Advances and Applications of DSmT for Information Fusion 
(Collected works), Vol. I 

American Research Press, Rehoboth, 2004 

ISBN: 1-931233-82-9 

Chapter VIII, pp. 155 - 191 


Abstract: This work proposes a logical interpretation of the non hybrid Dezert 
Smarandache Theory (DSmT). As probability is deeply related to a classical seman- 
tic, it appears that DSmT relies on an alternative semantic of decision. This se- 
mantic is characterized as a probabilized multi-modal logic. It is noteworthy that 
this interpretation justifies clearly some hypotheses usually made about the fusion 
rule (ie. the independence between the sensors). At last, a conclusion arises: there 
could be many possible fusion rules, depending on the chosen semantic of decision; 
and the choice of a semantic depends on how the actual problem is managed. Illus- 
trating this fact, a logical interpretation of the Bayesian inference is proposed as a 


conclusion to this chapter. 


8.1 Introduction 


\ X Y hen a non deterministic problem appears to be too badly shaped, it becomes difficult to make a 

coherent use of the probabilistic models. A particular difficulty, often neglected, comes from the 
interpretation of the raw data. The raw data could have a good probabilistic modelling, but in general 
such informations are useless: an interpretation is necessary. Determining the model of interpretation, 


and its probabilistic law, is the true issue. Due to the forgotten/unknown case syndrome, it is possible 
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that such model cannot be entirely constructed. In some cases, only a rather weak approximation of the 
model is possible. Such approximated model of interpretation may produce paradoxical results. This is 


particularly true in information fusion problems. 


Several new theories have been proposed for managing these difficulties. Dempster Shafer Theory 
of evidence [I] [5] is one of them. In this paper, we are interested in the Dezert Smarandache Theory 
(DSmT) [3], a closely related theory. These theories, and particularly the DSmT, are able to manipulate 
the model contradictions. But a difficulty remains: it seems uneasy to link these various theories. In 
particular, their relation with the theory of probability seems unclear. Such a relation is perhaps not 
possible, as could claim some authors, but it is necessary: it is sometimes needed to combine methods 
and algorithms based on different theories. This paper intends to establish such relations. A probabilized 
multi-modal logic is constructed. This probabilized logic, intended for the information fusion, induces the 
same conjunctive fusion operator as DSmT (ie. operator @). By the way, the necessity of independent 
sources for applying the operator © is clarified and confirmed. Moreover, this logical interpretation in- 
duces a possible semantic of the DSmT, and somehow enlightens the intuitions behind this theory. Near 
the end, the paper keeps going by giving a similar interpretation of the Bayes inference. Although the 
Bayes inference is not related to the DSmT, this last result suggests that probabilized logics could be a 


possible common frame for several non deterministic theories. 


Section [8.2] is beginning by a general discussion about probability. It is shown that probabilistic 
modellings are sometimes questionable. Following this preliminary discussion, two versions of the theory 
of evidence are introduced: the historical Dempster Shafer Theory and the Transferable Belief Model 
of Smets [8]. Section [8.3] makes a concise presentation of the Dezert Smarandache Theory. The short 
section [8-4] establishes some definitions about probability (and partial probability) over a set of logical 
propositions. These general definitions are needed in the following sections. Section [8-5] gives a logical 
interpretation of the DSmT on a small example. This section does not enter the theory too deeply: 
the modal logic associated to this interpretation is described with practical words, not with formulae! 
Section 8.6] generalizes the results to any cases. This section is much more theoretic. The modal logic is 
defined mathematically. Section [8-7] proposes a similar logical interpretation of the Bayesian inference. 


Section [8-3] concludes. 
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8.2 Belief Theory Models 


8.2.1 Preliminary: about probability 


This subsection argues about the difficulty to modelize “everything” with probability. Given a measurable 
universe of abstract events (or propositions) Q = {w;,i € I}, a probability P could be defined as a bounded 


and normalized measure over 2. In this paper, we are interested in finite models (J is finite). 


A probability P could also be defined from the probabilities p(w) of the elementary events w € 2. The 


density of probability p should verify (finite case) : 
p: 24 Rt, 


and: 


S> pw) =1. 


wEQ 


The probability P is recovered by means of the additivity property: 
VA CQ, P(A) = S> p(w). 
weEA 

It is important to remember how such abstract definitions are related to a concrete notion of “chance” 
in the actual universe. Behind the formalism, behind the abstract events, there are actual events. The 
formalism introduced by the abstract universe is just a modelling of the actual universe. Such a 
model is expected to be more suitable to mathematical manipulations and reasoning. But there is no 
reason that these actual events are compatible with the abstract events. Probability theory assumes 
this compatibility. More precisely, probability assumes that either the abstract and actual events are 
the same, either there is a mapping from the actual events to the abstract events (figure[8.1). When this 
mapping hypothesis is made, the density function makes sense then, in regard to the observation. Indeed, 


a practical construction of p becomes possible with a frequentist taste: 
1. Set p(w) = 0 for allweQ, 
2. Make N tossing of an actual event. For each tossed event, a, do: 
(a) Select the w € Q such that a maps to w, 
(b) Set pw) = pw) +1, 
3. Set p(w) ~ x, pw) for allw Ee. 


The next paragraph explains why the mapping from the actual events to the abstract events is not always 


possible and how to overcome this difficulty. 
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Actual universe (observations) Abstract universe (representation) 
x x x ° x x x e 
e x e e e x e e 
+— 
e e e e e e e e 
e e e e e e e e 


An abstract event is a connected component; in this example, 


the x-ed observations map to the unique x-ed component 


Figure 8.1: Event mapping: probabilist case 


8.2.1.1 The impossible quest of the perfect universe 


It is always possible to assume that there is a perfect universe, where all problems could be modeled, but 
we are not able to construct it or to manipulate it practically. However, we are able to think with it. Let 


A be the actual universe, let 2 be the abstract universe, and let Z be this perfect universe. 


The structure of 2 is well known; it describes our modelling of the actual world. This is how we interpret 
the observations. Practically, such interpretation is almost always necessary, while the raw observation 
may be useless. But 2 is only an hypothesis: our knowledge about the observation is generally insufficient 


for a true interpretation. 


The universe A is observed, but like Z its structure is not really known: although an observation is 
possible, it is not necessary possible to know the meaning, the true interpretation, of this observation. 


For example, what is the meaning of an observation for a situation never seen before? 


The universe Z is perfect, which means that it contains the two other, and is unknown. The word contains 
has a logical signification here, ie. the events/propositions of A or 2 are macro-events/macro-propositions 
of Z (figure [8:3) : 

ACP(Z) and QCP(Z), 


with the following exhaustiveness (x) and coherence (c) hypotheses for A and 2: 


x. 2=)a=) w, 


acA we, 


cl. [aj,a2 € A, a, Fag] SaiNag =9, 


c2. [wi ,w2 EO, wy Fug] Swi Nw. =O. 
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Actual universe Perfect universe Abstract universe 
e e 2 e e e e e e e e e e 
1 3 a b 
e e o e e e e e e e e e 
— K— 

e e e e e e e e e . e e e 

5 6 e 
e e e e e e e e eo! e e e 


An abstract event (ie. a,b,c, d,e) is a — connected component 


An actual event (te. 1,2,3,4,5,6) is a = connected component 
Figure 8.2: Event mapping: general case 


The exhaustiveness and coherence hypotheses are questionable; it will be seen that these hypotheses 


induce contradictions when fusing informations. 


Of course, the abstract universe Q is a coherent interpretation of the observations, when any actual 
event a € A is a subevent of an abstract event w € 2. But since the interpretation of A is necessarily 
partial and subjective, this property does not hold in general. The figure[B2]gives an example of erroneous 
interpretation of the observations: the actual event 5 intersects both the abstract event d and the abstract 
event c. More precisely, if an actual event a € A happens, there is a perfect event z € a which has 
happened. Since Z contains (ie. maps to) ©, there is an unique abstract event, w € , which checks z, 
ie. z € w. As a conclusion, when a given actual event a happens, any abstract event w € 2 such that 
wa #Q is likely to happen. Practically, such situation is easy to decide, since it just happens when a 
doubt appears in a measure classification. The table [8.1] refering to the example of figure[8.2] gives the 


possible abstract events related to each tossed observation. 


Finally, it does not seem possible to define a density of probability for unique abstract events from 
partially decidable observations. But it is possible to define a density function for multiple events. 


Again, a construction of such function, still denoted p, is possible in a frequentist manner: 
1. Set p(¢) = 0 forall dc Q, 
2. Make N tossing of an actual event. For each tossed event, a, do: 
(a) Define the set ¢(a) = {wEQ/wnaFO}, 
(b) Set p(o(a)) = p(¢(a)) +1, 


3. Set p(¢) ~ x p(¢) for all @cQ. 
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Tossed observation Possible abstract events 
e 
e e e e 
1 a ; e . e 
e e e 
e 
e e e 
e e a 
2 
e 
e 
e b 
3 e e 
e 
e 
e 
e 
e 
b 
e e 
et 
e 
e 
e 
e 
5 e . e , of 
e 
e 
e 
b e 
e e e e e 
6 > e . e > e€ 
e e e e e 


Table 8.1: Event multi-mapping for figure [8.2] 


In particular, p(0) = 0. 


In the particular case of table [8.1], this density is related to the probability of observation by: 


pita, ch = p(1) ) p{a} = p(2) ’ p{d} = p(3) + p(4) ) pte, d} = p(5) , pio, C, e} = p(6) . 
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The previous discussion has shown that the definition of a density of probability for the abstract events 
does not make sense, when the interpretations of the observations are approximative. However, it is 
possible to construct a density for multiple abstract events. Such a density looks quite similarly to the 


Basic Belief Assignment of DST, defined in the next section. 


8.2.2. Dempster Shafer Theory 
8.2.2.1 Definition 


A Dempster Shafer model [1] [2] [5] is characterized by a pair (Q,m), where © is a set of abstract events 
and the basic belief assignment (bba) m is a non negatively valued function defined over P(Q) , the set 


of subsets of 2, such that: 
m(0)=0 and x m(¢) =1. 


eca 


A DSm (Q,m) could be seen as a non deterministic interpretation of the actuality. Typically, it is a tool 


providing informations from a sensor. 


8.2.2.2 Belief of a proposition 


Let ¢ C Q be a proposition. Assume a basic belief assignment m. The degree of belief of ¢, Bel(¢) , and 
the plausibility of ¢, Pl(¢), are defined by: 


Bel(¢) = S) m() and PL¢)= J) mw). 


vce WNPAD 
Bel and PI do not satisfy the additivity property of probability. Bel(¢) and Pl(¢) are the lower and upper 
measures of the “credibility” of the proposition @. These measures are sometimes considered as the lower 


bound and the upper bound of the probability of ¢: 


Bel(¢) < P(¢) < PI(6) . 
This interpretation is dangerous, since it is generally admitted that probability and DST are quite different 
theories. 
8.2.2.3 Fusion rule 


Assume two bba mj, and mz, defined on the same universe (2, obtained from two different sources. It is 


generally admitted that the sources are independent. Then, the bba m; © mz is defined by: 


my © mo2(0) =0 ? 


mi ® m2(¢) = : S>omii)ma(}2), where Z=1— S> mai(yi)mo(2) . 
PpiNnpe=P pinye=0 


The operator © describes the (conjunctive) information fusion between two bba. 
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The normalizer Z is needed since the bba is zeroed for the empty set §. Except some specific cases, it is 


indeed possible that: 


ma (1 )me(2) > 0, (8.1) 


Wy. =O. (8.2) 


In particular, the property (8.2) is related to an implied coherence hypothesis; more precisely, since the 


universe () is defined as a set of events, the intersection of distinct singletons is empty: 


V{wr}, {wo} CO, {ar} A {wo} > {wi} N {or} =0. 


Notice that this hypothesis is quite similar to the hypothesis c2. of section[8.2.1). The coherence hypothesis 
seems to be the source of the contradictions in the abstract model, when fusing informations. Finally, 
Z <1 means that our abstract universe 2 has been incorrectly defined and is thus unable to fit the both 
sensors. Z measures the error in our model of interpretation. This ability of the rule @ is really new in 


comparison with probabilistic rules. 


8.2.3. Transferable Belief Model 


Smets has made an extensive explanation of TBM [8]. This section focuses on a minimal and somewhat 


simplified description of the model. 


8.2.3.1 Definition 


A Transferable Belief Model is characterized by a pair (Q,m), where ( is a set of abstract events and the 


basic belief assignment m is a non negatively valued function defined over P(Q) such that: 


$= m(¢) = 1. 


ocQ 


In this definition, the hypothesis m(0) = 0 does not hold anymore. 


8.2.3.2 Fusion rule 


Smets’ rule looks like a refinement of Dempster and Shafer’s rule: 
m1 ®ma(g)= D> mai(hr)me(2) . 
pinve=¢ 
Notice that the normalizer does not exist anymore. The measure of contradiction has been moved into 


m(@). This theory has been justified from an axiomatization of the fusion rule. 
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8.2.3.3 TBM generalizes DST 


First, notice that any bba for DST is a valid bba for TBM, but the converse is false because of @. Now, 
for any bba mr of TBM such that mr(@) <1, construct the bba A(mr) of DST defined by: 


mr() 


A(mr)() =0 and ¥6 CQ: G40, Amr) = | | ns 


A is an onto mapping. Any bba mp of DST is a bba of TBM, and A(mp) = mp. 


A is a morphism for @. JE. A(mr1 ®mrz2) = A(m71) 6 A(mrz). 
Proof. By definition, it is clear that: 
A(mr1) 6 A(mr2)(0) = 0 = A(mr1 O mr2)(9) . 


Now, for any ¢ CQ, such that 640: 


Yo Almr1) 1) A(mr.2)(H2) 


A(m A) & A(m 2)(@) _— biMpa= 
: : YY Alri) 1) A(mr.2) (2) 


o40 WiNw~e=¢ 


mr,1 (1) mr,2(2) 


1—mr,(0) Org = mr2(0) dD mra(Y1)mr,2(v2) 


hen piNyre= wi Nwe2=¢ 
= mr(1) malo) .3 S- mrai(wdi)mr.2(v2) 
», wee! 1— mr) “1- mr20) — o40 virde=o 
mri ® mr,2(9) mri &mr7,2(¢) 
a ? ? = ? ? _ A . 
S- mri ®mr2(¢) 1l—mri® mr2(0) (mr1 ® m7,2)(¢) 
PAO 


Since A is an onto morphism, TBM is a generalization of DST. More precisely, a bba of TBM contains 
more information than a bba of DST, ie. the measure of contradiction m(@), but this complementary 
information remains compatible with the fusion rule of DST. 

The Dezert Smarandache Theory is introduced in the next section. This theory shares many common 
points with TBM. But there is a main and fundamental contribution of this theory. It does not make 
the coherence hypothesis anymore and the contradictions are managed differently: the abstract model 
is more flexible to the interpretation and it is not needed to rebuild the model in case of contradicting 


sensors. 


8.3 Dezert Smarandache Theory (DSmT) 


Both in DST and in TBM, the difficulty in the model definition appears when dealing with the con- 
tradictions between the informations. But contradictions are unavoidable, when dealing with imprecise 


informations. This assertion is illustrated by the following example. 
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B&W example. Assume a sensor s; which tells us if an object is white (W) or not (NW), and gives 
no answer (NAj) in litigious cases. The actual universe for this sensor is A; = {W, NW,NAj}. Assume 
a sensor S2 which tells us if an object is black (B) or not (NB), and gives no answer (NAg2) in litigious 
cases. The actual universe for this sensor is Aj = {B, NB, NAg}. These characteristics are not known, 
but the sensors have been tested with black or white objects. For this reason, it is natural to model 
our world by 2 = {black, white}. When a litigious case happens, its interpretation will just be the pair 


{black, white}. Otherwise the good answer is expected. The following properties are then verified: 
B,NW c black and W,NBC white. 


The coherence hypothesis is assumed, that is black MN white =. The event black MN white is impossible. 
This model works well, as long as the sensors work separately or the objects are still black or white. Now, 
in a true universe there are many objects which are neither white and neither black, and this without 
any litigation. For example: gray objects. Assume that the two sensors are activated. Then, the fused 
sensors will answer NWMNB, which will be interpreted by blackN white. This contradicts the coherence 


hypothesis. 


Conclusion. This example is a sketch of what generally happens, when constructing a system of de- 
cision. Several sources of information are available (two sensors here). These sources have different 
discrimination abilities. In fact, these discrimination abilities are not really known, but by running these 
sources on several test samples (black and white objects here), a model of theses abilities is obtained 
(here it is learned within 2 that our sensors distinguish between black and white objects). Of course, it 
is never sure that this model is complete. It is still possible actually that some new unknown cases could 
be discriminated by the information sources. In the example, the combination of two sensors made it 
possible to discriminate a new class of objects: the neither black, neither white objects. But when fusing 
these sensors, the new cases will become contradictions regarding the coherence hypothesis. Not only the 
coherence hypothesis makes our model contradictory, but it also prevents us from discovering new cases. 
The coherence hypothesis should be removed! Dezert and Smarandache proposed a model without the 


coherence hypothesis. 


8.3.1 Dezert Smarandache model 


In DST and TBM, the coherence hypothesis was implied by the use of a set, 9, to represent the ab- 
stract universe. Moreover, the set operators 1, U and c (te. set complement) were used to explain the 
interactions between the propositions ¢ C 2. In fact, the notion of propositions is related to the notion 
of Boolean Algebra. Sets together with set operators are particular models of Boolean Algebra. Since 
DSmT does not make the coherence hypothesis, DSmT cannot rely on the set formalism. However, some 


boolean relations are needed to explain the relations between propositions. Another fundamental Boolean 
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Algebra is the propositional logic. This model should be used for the representation of the propositions 
of DSmT. Nevertheless, the negation operator will be removed from our logic, since it implies itself some 
coherence hypotheses, eg. 6 \ ad = L! By identifying the equivalent propositions of the resulting logic, 
an hyper-power set of propositions is obtained. Hyper-power sets are used as models of universe for the 
DSmT. 


8.3.1.1 Hyper-power set 


Let © = {¢;/i € I} be a set of propositions. The hyper-power set < © > is the free boolean pre-algebra 


generated by ® and the boolean operators A and V: 
®,<@>A<@>,<O>V<O>cC< 65> 

and A, V verify the properties: 

Commutative. 6\wW=WAgdanddVv=VvVo, 

Associative. dA (WAn) = (dA) Anand dV (WV) =(éVY)V4, 

Distributive. 6A (WV) = (@AY)V (An) and Vv (WAN) = (VY) A(OV 0), 

Idempotent. 6\gd=¢and dV ¢=¢, 

Neutral sup/sub-elements. 6\(éVuv)=¢dand dV (dAv)=¢, 
for any ¢,v¥,n E< ®>. 
Unless more specifications about the free pre-algebra are made, this definition forbids the propositions to 
be exclusive (no coherence assumption) or to be exhaustive. In particular, the negation operator, =, and 


the never happen/ always happen, /T, are excluded from the formalism. Indeed, the negation is related 


to the coherence hypothesis, since T is related to the exhaustiveness hypothesis. 


Property. It is easily proved from the definition that: 


VO, WE< >, dAV=zG0 —] OVYVY=Y. 
The order < is a meta-operator defined over < ® > by: 
PSP = ONV=O —S OVY=Y. 
The order < is a meta-operator defined over < ® > by: 


o<v = [¢<vand dF]. 


The hyper-power set order < is the analogue of the set order C . 
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8.3.1.2 Dezert Smarandache Model 


A Dezert Smarandache model (DSmm) is a pair (®,m), where the (abstract) universe ® is a set of 
propositions and the basic belief assignment m is a non negatively valued function defined over < ® > 


such that: 


8.3.1.3 Belief Function 


The belief function Bel is defined by: 


VoE< >, Bel()= So my). (8.3) 
WE<B>:~<d 


Since propositions are never exclusive within < ® >, the (classical) plausibility function is just equal to 


1. The equation (8:3) is invertible: 


Vb E< & >, m(d) =Bel(@)— SY) my). 


pE<O>v<¢ 
8.3.2 Fusion rule 


For a given universe ® , and two basic belief assignments m and mz, associated to different sensors, the 


fused basic belief assignment is m1 @ mz, defined by: 


m1 © m2(9)= D> ma(r)ma(2) . (8.4) 
piAp2=6 


8.3.2.1 Dezert & Smarandache’s example 


Assume a thief (45 years old) witnessed by a granddad and a grandson. The witnesses answer the 
question: is the thief young or old? The universe is then ® = {young, old}. The granddad answers that 


the thief is rather young. Its testimony is described by the bba: 
m,(young) = 0.9 and my(young V old) = 0.1 (slight unknown) . 
Of course, the grandson thinks he is rather old: 
mg(old) = 0.9 and me(young V old) = 0.1 (slight unknown) . 


How to interpret the testimonies? The fusion rule says: 


m, © m2(young A old) = 0.9801 (highly contradicts — third case) 


m, © m2(young) = m1 ®@ me(old) = 0.0099 


m, © m2(young V old) = 0.0001 


Our hypotheses contradict. There were a third case: the thief is middle aged. 
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8.3.2.2 Comments 


In DSmT, there is not a clear distinction between the notion of conjunction, A, the notion of third case 
and the notion of contradiction. The model does not decide for that and leaves this distinction to our 
last interpretation. It is our interpretation of the model which will make the distinction. Thus, the 
DSm model avoids any over-abstraction of the actual universe. Consequently, it never fails although 
we could fail in the last instance by interpreting it. Another good consequence is that DSmT specifies 
any contradiction/third case: the contradiction ¢ A ~ is not just a contradiction, it is the contradiction 


between ¢ and w. 


8.4 Probability over logical propositions 


Probabilities are classically defined over measurable sets. However, this is only a manner to modelize the 
notion of probability, which is essentially a measure of the belief of logical propositions. Probability could 
be defined without reference to the measure theory, at least when the number of propositions is finite. 
In this section, the notion of probability is explained within a strict logical formalism. This formalism is 


of constant use in the sequel. 


Intuitively, a probability over a set of logical propositions is a measure of belief which is additive (disjoint 
propositions are adding their chances) and increasing with the proposition (weak propositions are more 
probable). This measure should be zeroed for the impossible propositions and full for the ever-true 
propositions. Moreover, a probability is a multiplicative measure for independent propositions. ‘The 
independence of propositions is a meta-relation between propositions, which generally depends on the 


problem setting. 


These intuitions are formalized now. It is assumed that the reader is used with some logical notions. 


8.4.1 Definition 


Let L be at least an extension of the classical logic of propositions, that is L contains the operators A, V , 
— (and, or, negation) and the propositions | , T (always false, always true). Assume moreover that some 
propositions pairs of L are recognized as independent propositions (this is a meta-relation not necessarily 
related to the logic itself). A probability p over L is a IR* valued function such that for any proposition 
@ and w of L: 


Additivity. p(¢\ wv) + p(oV wv) = p(d) + v(v), 


Coherence. p(L) =0, 


Finiteness. p(T) =1, 
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Multiplicativity. When ¢ and w are independent propositions, then p(¢ A W) = p(¢)p(w) . 


8.4.2 Property 


The coherence and additivity implies the increaseness of p: 
Increaseness. p(¢ A) < p(). 
Proof. Since ¢ = (6A w) V (6A mw) and (6A WV) A (dA 7) = L, it follows from the additivity: 
P(d) + p(L) = p(PAY) + (PA 7H) . 


From the coherence p(_L) = 0, it is deduced p(¢) = p(@AW) + p(@A mW). Since p is non negatively 
valued, p(¢) => p(@ AY). 


8.4.3 Partially defined probability 


In the sequel, knowledges are alternately described by partially known probabilities over a logical system. 


Typically, the probability p will be known only for a subset of propositions @ Cc L. 


Partial probabilities have been investigated by other works [9], for the representation of partial knowl- 
edge. In these works, the probabilities are characterized by constraints. It is believed that this area 
has been insufficiently investigated. And although our presentation is essentially focused on the logical 
aspect of the knowledge representation, it should be noticed that it is quite related to this notion of 
partial probability. In particular, the knowledge of the probability for a subset of propositions implies 
the definition of constraints for the probability over the whole logical system. For example, the knowledge 


of 7 = p(@ Aw) implies a lower bound for p(¢) and p(w): p(d) > a and p(w) > 7. 


The next section introduces, on a small example, a new interpretation of DSmT by means of proba- 
bilized logic. 
8.5 Logical interpretation of DSmT: an example 


A bipropositional DSm model A = ({¢1, ¢2},m) is considered. This section proposes an interpretation 


of this DSm model by means of probabilized modal propositions. 


8.5.1 <A possible modal interpretation 


Consider the following modal propositions: 
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U. Unable to decide between the ¢;’s, 
a;. Proposition ¢; is sure ; No Other Information (NOD) , 
I. Contradiction between the ¢,’s. 
It is noticeable that these propositions are exclusive: 
Va,b € {U,a1,02,I}, aFbSaANb=L. (8.5) 


These propositions are clearly related to the propositions ¢; : 


IT< dAdo, b1, 62, 1 V oe [the contradiction I implies everything] 
ai< oi, 61.V¢2, fori=1,2 [a; implies ¢; and ¢1 V ¢9] (8.6) 
U<V ¢2 [U only implies $1 V $2] 


These propositions are also exhaustive; ie. in the universe ® , either one of the propositions J, a1, a2,U 
should be verified: 
IVarVagVU= @1V 2. (8.7) 


Since the propositions a;,U,I are characterizing the knowledge about ¢; (with NOT), the doubt or the 
contradiction, it seems natural to associate to these propositions a belief equivalent to m(¢;) , m(¢1 V ¢2) 


and m(¢, A ¢2). These beliefs will be interpreted as probabilities over J, U and a; : 


pL) = m(¢1 A 2) , p(U) = m(¢1 V ¢2) . p(ai) = m(¢;), fori =1,2. (8.8) 


Such probabilistic interpretation is natural but questionable: it mixes probabilities together with bba. 
Since the propositions ¢; are not directly manipulated, this interpretation is not forbidden however. In 
fact, it will be shown next that this interpretation implies the fusion rule © and this will be a posterior 


justification of such hypothesis. 


8.5.2 Deriving a fusion rule 


In this section, a fusion rule is deduced from the previous probabilized modal interpretation. This rule 
happens to be the (conjunctive) fusion rule of DSmT. 
Let A; = ({¢1, ¢2},m,;) be the DSm models associated to sensors 7 = 1,2 working beside the same 


abstract universe {¢1, 62}. Define the set of modal propositions S; = {J;,aj1,aj2,U;}: 
U;. Unable to decide between the ¢;’s, according to sensor 7 , 
aj; Proposition @; is sure and NOI, according to sensor 7 , 


I; . Contradiction between the ¢;’s, according to sensor j. 
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The propositions of $; verify of course the properties (8.5), (8.6) , (8.7) and (8.8), the subscript ; being 
added when needed. Define: 


S=S, A Sp = {a1 A ae/ay € S41 and ag € So}. 


Consider a = a; A ag and b = b; A bz, two distinct elements of S. Then, either a, 4 6; or ag  b2. Since 


S; verifies (8.5) , it follows a; A by = L or az A bg = L, thus yielding: 
(a4 A az) mx (by A bg) = (ay x bi) mx (a2 A bz) ae 


S is made of exclusive elements. It is also known from (8&7) that $1 V ¢2 = Vases; a; ; 5; is exhaustive. 


It follows: 
b1 V b2 = (1 V $2) A (G1 V $2) = \ VV a= Va. 


j=1 a;€S; aes 


S is exhaustive. In fact, S enumerates all the possible cases of observation. It is thus reasonable to think 
that the fused knowledge of these sensors could be constructed from S. The question then arising is: 
what is the signification of a proposition a, \ az € S? It is remembered that a proposition of 5; just tells 
what is known for sure according to sensor 7. But the semantic for combining sure or unsure propositions 


is quite natural 


e unsure + unsure = unsure 


e unsure + sure = sure 
e sure + sure = sure OR contradiction 
e anything + contradiction = contradiction 


In particular contradiction arises, when two informations are sure and these informations are known 


contradictory. This conduces to a general interpretation of S: 


A Ip Q21 a22 Us 


J, Contradiction Contradiction Contradiction Contradiction 


Qi, Contradiction ¢, is sure Contradiction ¢, is sure 
Q\2 Contradiction Contradiction o2 is sure o2 is sure 
U, Contradiction og, is sure 2 is sure Unsure 


At last, any proposition of S' is a sub-event of a proposition J, a 1,a@ 2 or U, defined by: 
U. The sensors are unable to decide between the ¢;’s, 
a;,. The sensors are sure of the proposition ¢; , but do not know anything else , 


I. The sensors contradict . 


lIn fact, the independence of the sensors is implicitly hypothesized in such combining rule (refer to next section). 
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Since S' is exhaustive, the propositions U, a;, J are entirely determined by S: 


elJ= (ih A Tp) V (hh A Q21) V (hh mx Q22) V (hh A U2) V (a1 \ In)V 
(a12 \ Iz) V (U1 A Ip) V (ay2 /\ Q21) V (au \ Q22) ‘ 


eq= (a1; A aa) V (U1 A ai) V (a1; A U2), 
e U=U,AU2. 


The propositions I,a;,U are thus entirely specified and since S' is made of exclusive elements, their 


probabilities are given by: 
e pL) = pi A 12) + ph A a1) + pi A a2) + ph A U2) +--+ + plan A a22), 
© p(a;) = p(ar A aa) + p(U1 A a2:) + p(ari A U2), 
e p(U) = p(U; A U2). 


At this point, the independence of the sensors is needed. The hypothesis implies p(a; A az) = p(a1)p(az). 


The constraints (8:8) for each sensor j then yield: 
© pL) = mi(¢1 A G2)ma(g1 A G2) + mi(1 A b2)ma(P1) + +++ + m1(b1)mM2(d2) , 
© p(ai) = m1(b;)mMa(di) + mi(o1 V b2)ma(Gi) + M1 (Gi)M2(b1 V 2), 
e p(U) = mi(¢1 V b2)me(¢1 V ¢2) - 
The definition of m; © mg implies finally: 
pl) =m1 @m2(o1\¢2), — plas) = m1 @ma2(gi), and —— p(U) = m1 © ma(P1 V $2). 


Our interpretation of DSmT by means of probabilized modal propositions has implied the fusion rule @. 


This result is investigated rigorously and generally in the next section. 


8.6 Multi-modal logic and information fusion 


This section generalizes the results of the previous section. The presentation is more formalized. In 
particular, a multi-modal logic for the information fusion is constructed. This presentation is not fully 


detailed and it is assumed that the reader is acquainted with some logical notions. 


8.6.1 Modal logic 


In this introductory section, we are just interested in modal logic, and particularly in the T-system. There 


is no need to argue about a better system, since we are only interested in manipulating the modalities 
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aQ, © and 70. 


o] 


Being given ® a set of atomic propositions, the set of classical propositions, C(®) more simply denoted 


C, is defined by: 
e@cC,leCandTec, 
elf dwecC,then PEC, dAVEC, dVyEeCanddg-YWEC. 
The set of modal propositions, M(®) also denoted M, is constructed as follows: 


eCcmM, 


eIfgeM, then O6€ M andode M, 


elfgweM,then-gE€M,dGAVEM, OVWPEMandgr-YPEeM. 


The proposition O¢ will mean that the proposition @ is true for sure. The proposition o@ will mean that 


the proposition @ is possibly true. 
In the sequel, the notation / ¢ means that ¢ is proved in T. A proposition ¢ such that | ¢ is also called 
an axiom. The notation ¢ = wW means both g-— yw andFyw— @. 


All axioms are defined recursively by assuming some deduction rules and initial axioms. 
Modus Ponens (MP). For any proposition ¢,  € M, such that + ¢ and+ ¢ > W, it is deduced w. 


Classical axioms. For any ¢,~,7 © M, it is assumed the axioms: 


6. d= W=6VY, 
7. dAv = 7(40V -W). 


It is deduced from these axioms that: 


e The relation + ¢ — yw is a pre-order with a minimum | and a maximum T: _ is the strongest 


proposition, T is the weakest proposition, 


e The relation = is an equivalence relation. 


Modal axioms and rule. Let ¢,W¢€ M. 


i. From ¢ is deduced | O¢; axioms are sure. This does not mean + ¢ — O@ which is false! 
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ii. Fk O(¢ > Ww) — (Ad — Ow); when the inference is sure and the premise is sure, the conclusion 


is sure, 


iii. / O¢ — ¢; sure propositions are true, 


iv. o¢ = —0-9; is unsure what cannot be false for sure. 


It is deduced that the proposition O@ is stronger than ¢ which is stronger than o@. 


Notation. In the sequel, 7 < ¢ means + w — ¢, and w < @ means both y < dand d6#w. 


The logical operators are compatible with =. Denote ¢/= = {~ € M/w = ¢}, the class of 
equivalence of ¢. Let d,weM, db € ¢z and w w=. Then holds: 


© do be (G>v)z © 99 € (>9)/= © AVE (AY) = 
© ovebe(dVey= © O¢ € (Od) /= © od€ (o¢)/= 
The logical operators over M are thus extended naturally to the classes of M by setting: 
© be = 2 (¢ W)/= © 7¢/= 4 (7¢) = © b=eNVj= (dA W)/= 
© b/2V j= 5 (6VY)/= © Od/25(Gd)j= oda 5 (0) /= 


From now on, the class ¢/= is simply denoted ¢. 


Hyper-power set. Construct the subset of classical propositions F'(®) recursively by the properties 
® c F(®) andVd,wv € F(®), [dA € F(®) and dVw € F(®)]. The hyper-power of ® , denoted < © >, 


is the set of equivalence classes of F'(®) according to the relation =: 


<@>=F(O)j2={¢/2/9E FO). 


8.6.1.1 Useful theorems 


Let ¢6, WE M. 


1. F (O¢ADy) > OG Ay) and F O(6 Ay) > (O¢A DY) 


2. (od V oy) = of VW) and fk o(@ V w) > (OC V ow) 


3. F (Ad V OY) > O(¢ VY) but ¥ O(¢ Vy) (Gd Vv Oy) 


4. Fo(PAW) = (OA ow) but F (oP A ov) 4 (GAY) 


Proof. Theorem [and theorem PJare dual and thus equivalent (rules[/Jand iv.). It is exactly the same 
thing for theorem B] and theorem J] 
Proof of  (O¢ A Oy) > O(@Ay). 


Classical rules yield the axiom: 
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Fo (b> AY) 

Rule i. implies then: 

F O(¢ > (b > (6A Y))) 

Applying rule ii. twice, it is deduced: 

F O¢ > O( > (GA Y)) 

F O¢ — (Op > O(¢ A y)) 

The proof is concluded by applying the classical rules. 
Proof of f O(¢ A w) > (O¢ A OW). 


Classical rules yield the axioms: 


F (dA) > gandt (6A¥Y) > o 


Rule i. implies then: 

F O(@ Ad) > @) andF O((G AY) > ¥) 
Applying rule ii., it is deduced: 

F O(@ Aw) > O¢ and F O(G Aw) > OY 
The proof is concluded by applying the classical rules. 


Proof of Fk (¢ V Oy) = O(¢V w). 
Classical rules yield the axioms: 


Kb (eV yp) andé p > (dV y) 
Rule i. implies then: 


F O(¢ > (PV #)) and F O( > (¢ Vv #)) 
Applying rule ii., it is deduced: 


F O¢ > O(6 V p) andk Op > O(¢ v p) 
The proof is concluded by applying the classical rules. 


Why ¥ O(¢ Vy) > (O¢v Oy)? 


To answer this question precisely, the Kripke semantic should be introduced. Such discussion is 


outside the scope of this paper. However, some practical considerations will clarify this assertion. 
When ¢V w ts sure, does that mean that ¢ is sure or w is sure? Not really since we know that ¢ or 
w is true, but we do not know which one is true. Moreover, it may happen that ¢ is true sometimes, 


while ~ is true the other times. As a conclusion, we are not sure of ¢ and are not sure of q. 


This example is a counter-example of F O(¢ V w) > (G¢V OW). 
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8.6.2 A multi-modal logic 


Assume that several informations are obtained from different sources. Typically, these informations 


are modalities such as “according to the source 0 , the proposition @ is sure”. Such a modality could 


be naturally denoted O0,¢ (a modality depending on a). A more readable notation [dc] 2 of is 


prefered. Take note that [¢|o] is not related to the Bayes inference (¢|c)! Now, the question arising is 
how to combine these modalities? For example, is it possible to deduce something from [¢;|o1] A [¢2|o2] ? 
Without any relations between heterogeneous modalities, it is not possible to answer this question. Such 
a relation, however, is foreseeable. Assume that the source tT involves the source 0, ie. T — 0. Now 
assume that the proposition ¢ should be known from the source a, ie. [¢|o]. Since 7 involves a, it is 
natural to state that ¢ should be known from the source 7, ie. [¢|T]. This natural deduction could be 
formalized by the rule: 


-r—+o implies F [dla] > [dlr] . 
With this rule, it is now possible to define the logic. 


The set of multi-modal propositions, mM(®) also denoted mM, is defined recursively: 
eCcmM, 
e Ifd,o€mM, then [d|o] EemM, 
elfd,0emM,then -PEmMM, d6AVEMM, 6VVEmMM andd—wvemM. 

The multi-modal logic obeys to the following rules and axioms: 

Modus Ponens. 

Classical axioms. Axioms[I]tof[/], 

Modal axioms and rule. Let o,7,¢,W €mM. 


m.i. From ¢ is deduced + [d|o]: axioms are sure, according to any sources, 


m.ii. F [6 > Io] > ([dlo] — [wI\o]) . If a source of information asserts a proposition and recognizes 


a weaker proposition, then it asserts this weaker proposition , 


m.iii.  [dlo] > &. The sources of information always tell the truth. If a source asserts a proposition, 


this proposition is actually true, 


m.iv.} 7—o implies + [¢|o] > [¢|7]. Knowledge increases with stronger sources of informa- 


tion. 


The axiom m.iii. is questionable and may be changed. But the work presented in this paper is restricted 


to this axiom. 
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It is also possible to consider some exotic rules like ¢ = [¢|L], ie. a perfect source of information L 
yields a perfect knowledge of the propositions ¢. Similarly, the modality [¢|T] could be interpreted as 
the proposition “¢ is an absolute truth” or “d has a proof”: one does not need any source of information 


to assert an absolute truth... 


8.6.3. Some multi-modal theorems 
8.6.3.1 Modal theorems map into multi-modal logic 


Let 4p € M be a modal proposition. Let ¢ € mM be a multi-modal proposition. Let p[o] € mM be the 


multi-modal proposition obtained by replacing 0 by [-|o] and o by —|-- |o] in the proposition 4. Then 


F yw implies plo]. 
8.6.3.2 Useful multi-modal theorems 
If the source o asserts @ and the source T asserts w, then the fused sources assert 6A W : 


F ([dlo] A blz] ) > [6A dle Az] 


Proof. From the axioms + (a Ar) > a andF (0 AT) > 7, it is deduced: 


F [lo] > [glo Az], 


and 


F [lr] > [plo Az]. 


From the useful theorems proved for modal logic, it is deduced: 
blo At] A Wlo Ar] = [6A lo Az]. 


The proof is concluded by applying the classical rules. 


If one of the sources o or T asserts }, then the fused sources assert @ : 


F ([dle] v [¢l7]) > [glo A7]. 


Proof. This results directly from + [¢|o] — [dla Az] and + [¢|T] > [dla Az]. 


The converse is not necessarily true: 


¥ [dla A7] > (Iele] v [9I7]) - 
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In fact, when sensors are not independent and possibly interactive, it is possible that the fused sensor 0 \T 
works better than o and T separately! On the other hand, this converse property could be considered as 
a necessary condition for the sensor independence. This discussion leads to the introduction of a new 


axiom, the independence axiom m.indep. : 


m.indep. - [dla A 7] > ((¢|o] V [¢|7] ) : 


8.6.4 Sensor fusion 
8.6.4.1 The context 


Two sensors, o and 7, are generating informations about a set of atomic propositions ®. More precisely, 
the sensors will measure independently the probability for each proposition ¢ €< ® > to be sure. In this 


section, it is discussed about fusing these sources of information. 


This problem is clearly embedded in the multi-modal logic formalism. In particular, the modality [-|c] 
characterizes the knowledge of o about the universe < ® >. More precisely, the proposition [¢|c] explains 
if @ is sure according to o or not. This knowledge is probabilistic: the working data are the probabilities 


p({dla]) and p([d|r]) for ¢ €< ® >. The problem setting is more formalized in the next section. 


Notation. From now on, the notation p[¢|o] is used instead of p([d|c]). Beware that p[d|o] is not 


the conditional probability p(¢|c) ! 


8.6.4.2 Sensors model and problem setting 


The set of multi-modal propositions, mM (0), is constructed from the set 0 = ®U{a, rT}. The propositions 
o and 7 are referring to the two independent sensors. The proposition o A 7 is referring to the fused 
sensor. It is assumed for sure that V4. ¢ is true: 


aNE 


oEe® 


Consequently: 


+ 


Vs a1 


oe® 


ona [Ve 


pEe® 


ly 


oe® 


1-Iy 


oEe® 


*|=1. 


The sensors o and 7 are giving probabilistic informations about the certainty of the other propositions. 


and: 


“| V9 


pEe® 


oho] =» V d 


oEe® 


°| =»| Vo 


pEe® 


More precisely, it is known: 
pidlo] and pld|7], forany dc<@>. 


Since the propositions o and 7 are referring to independent sensors, it is assumed that: 
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e The axiom m.indep. holds for o and 7, 


e For any ¢,U €< ® >, p([dlo] A [W|7]) = p[dlo]p[w|r] . 


A pertinent fused information is expected: 


How to compute p[dlo At] for any 6E< ®> ? 


8.6.4.3 Constructing the fused belief 


Defining tools. The useful propositions ¢() are defined for any ¢ €< ® >: 


[slo] A -( VV we] : 


WpE<O>:V~<o 


I> 


pi?) 


The same propositions 7) are defined for 7: 


er -( VV vie). 


WpE<O>:~<o 


I> 


gp” 


Properties. 


The propositions ¢™ are exclusive: 
AO AYO =L, forany d#v. 


Proof. Since [dla] A [~|o] = [6 A Yo], it is deduced: 


PO APO = aslo -( V a) nl V a) 


nn<d mN<wH 


It follows: 
0 AYO = [PA Yo] A ( \ “] A ( \ “ie 
mn<p mN<yp 
Since AW < dor d6AY < ww when ¢ Zw, the property is deduced. 


Lemma: VV [lo] < [do] . 
wp<¢d 


Proof. The property ~ < ¢ implies successively F w > 6, + [WwW — glo] and + [wlo] > [dla]. The lemma 


is then deduced. 


The propositions ¢”) are exhaustive: 


VV w = [dlo], and in particular: VV a ae 
wpse gE<o> 
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Proof. The proof is recursive. First, the smallest element of < ® > is p= A occas % and verifies: 


po) = [plo]. 
Secondly: 
VV yp = 9 v ( VV #0) aay ( VV My ) = 9) v ( VV ve] ; 
wpse wp<¢g wp<o mnsy pip<¢ 


Since $() = [dla] A =(Veev<ollel) and Vy.ycallo] < [dle] , it follows: 


VV wp = [dle]. 
pps 
The second part of the property results from: 


VV po?) VV wand Vs 


~E<O> WWSV g.¢€0 ? pEe® 


=T% 


oO 


The propositions @) are also exclusive and exhaustive: 
gM APM =1, forany oF#¥, 


and: 


VV ge = Ts 


pE<d> 


It is then deduced that the propositions 6°) Aw are exclusive and en | 


Vor, U1, G2, We E< ® > 4 (1, 1) x (2, W2) => (oh? A pi) A Om \ ps”) eal ) (8.9) 


and: 


Vo Gay) =. (8.10) 


b,pE<®P> 


From the properties (8.9) and (8.10) , it results that the set: 
B= {9 rw) /dpe<a>} 


is a partition of T. This property is particularly interesting, since it makes possible the computation of 
the probability of any proposition factorized within ©: 
vace.ol Vo] = 5° r(d). (8.11) 
ged ed 


?The notation (¢1,%1) = (¢2,%2) means ¢1 = ¢2 and yy = yo. 
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Factorizing $77). It has been shown that + ([@|o] A [W|7]) = [6A vlo AT]. It follows: 


F VE (dle A felt) = le Az, (8.12) 


PAYSN 


The axiom m.indep. says + [nla A 7] > ({nlo] Vv [n|7]). Since [Veco 4|°] = [Veco 4|7] =T, it is 


deduced: 
+ [nlo Ar] = ( (nes [Vacedlr}) v ([Veeeslo] 4 in) ) 
At last 
InloAtT]= VY ([dle] A fir)) - (8.13) 
PAUSN 


It is then deduced: 


sor stele ari aa( Ver =( ys ([nlo] A [¢|7] Na (Vv V (Inlo] A [C7] )). 


p< nNOS Ww<h NAC 
Now: 
Voacee(lnlo] A [Clr]) = Varecel (Vecrt) A (Wa “)) 
x Vanncse Ve<n Vie (Ee 1A x ye Vance” () A Co) . 
At last: 


gor) = (Vancsl" \ 2) et @ Vancey ln? A “)) 


Since & is a partition, it is deduced the final results: 


pon) = VV ni?) A ch (8.14) 
n\G=o 
and: 
p(o) se S- p(n AC) . (8.15) 
n\C=o 


This last result is sufficient to derive p[d|o A 7], as soon as we are able to compute the probability over 


X. The next paragraph explains this computation. 
The probability over ©. 


Computing p(¢°)) and p(¢”) . These probabilities are computed recursively from p[d|o] and 


p|o|r]. More precisely, it is derived from the definition 6%) = [d|o] A -(Vy Z slvlel) and the property 


[glo] = Vue wo) that: 


= [dlo] nl VV 0) i 


v<o 
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Since the propositions ¢() are exclusive and Vees WO < [elo], it follows: 


P($) = pidlo]— S- pv) . (8.16) 
w<ob 


This equation, related to the Moebius transform, is sufficient for computing p(o) recursively. 


Deriving p( gle”) . First, it is shown recursively that: 
P(E AG) = (6) (YU) . nt 


Proof - step 1. For the smallest element pp = | gests , it happens that yu?) = [u|o] and w™ = [p|z]. 
Since [y|o] and [u|T] are independent propositions, it follows then p(w) A pi7)) = p(w) p(w) ‘ 
Proof - step 2. Being given ¢,~ €< ® >, assume p(n A cn) a P(n (7))p (¢)) for any 7,¢ €< ®> 


such that (7 < @ and ¢ < y) or (7 < dand ¢ < y). From [d|o] = V,<40 (7) and [|r] = Vecy C(7) 
it is deduced: 


(dle) A [417] = (Vv i) A (V ) =V V (vac). 


n<¢ CS nse C<w 
It follows: 
P([dlo] A WIt]) = 32 So p(n Ag 
nod C<w 

pldlo] = © p(n”) 
nS 

pllr] = S- o(6™) 
CSP 


Now, [¢|c] and [:)|r] are independent and p([d|o] A [¥|7]) = p[dlo]p[b|r] . Then: 


S- Ss p(n A) =) Ss r( (n°) p(o™) . 


nso C<w nso C<w 


From the recursion assumption, it is deduced p(¢ Av™) = p(o)p(y™) . 


From the factorization , it is deduced: 


pg) = S$ p(n) (6) (8.18) 


n\G=o 


This result relies strongly on the independence hypothesis about the sensors. 


Back to [d|o \ 7]. Reminding that [d|o \7] = Vycy 7), the fused probability p[d|o A 7] is 


deduced from p(perr”) by means of the relation: 


pidlo At] = > owe) . (8.19) 
oxo 
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Conclusion. It is possible to derive an exact fused information p[dlo AT] , ¢@ €< ® > from the 
informations p[d|o] , 6 €< ® > and p[d|7] , é €< ® > obtained from two independent sensors o and T. 


This derivation is done in 3 steps: 
¢ Compute p(¢'”)) and p(¢”) by means of (SIG) , 
e Compute p(¢'°")) by means of (18), 


e Compute p[¢|o A 7] by means of (8.19) . 


8.6.4.4 Link with the DSmT 


It is noteworthy that the relation (8-18) looks strangely like the DSmT fusion rule (&4) , although these 
two results have been obtained from quite different viewpoints. In fact the similarity is not just related 
to the fusion rule and the whole construction is identical. More precisely, let us now consider the problem 


from the DSmT viewpoint. 


Let be defined for two sensors o and 7 the respective bba m, and m, over < ® >. The belief function 


associated to these two bba, denoted respectively Bel, and Bel, , are just verifying: 


Belg(¢) = 5) mo(w) and Bel,(4) = $° m-(#). 
pre pre 


Conversely, the bba m, is recovered by means of the recursion: 


Vo E< ® >, mo(¢) = Bel (¢) — > mo(#) - 
p< 


The fused bba m, © m, is defined by: 


Me @m-(¢)= S> mo(v)m,(n). 
pAn=ed 


Now make the hypothesis that the probabilities p[¢|c] and p[¢|r] are initialized for any ¢ €< ® > by: 


p[dlo] = Bel,(¢) and _ p[¢|7] = Bel, (¢) . 
Then, the following results are obviously obtained: 
© p(d) = me(d) and p(s) = me(d), 
© p(go7)) =m, &m-(4), 
e pldlo A Tr] = Bel, ® Bel,(¢) , where Bel, © Bel, is the belief function associated to mz ®m,. 


From this discussion, it seems natural to consider the probabilized multi-modal logic mM as a possible 


logical interpretation of DSmT. 
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Evaluate the consequence of the independence axiom. By using the axiom m.indep. , it is possible 
to prove (813). Otherwise, it is only possible to prove (812), which means that possibly more belief 
is put on the smallest propositions, in comparison with the independent sensors case. Such a property 
expresses a better and more precise knowledge about the world. Then it appears, accordingly to the 
mM interpretation of DSmT, that the fusion rule © is an optimal rule only for fusing independent and 


(strictly) reliable sensors. 


8.7 Logical interpretation of the Bayes inference 


Notation. In the sequel, ¢ < w is just an equivalent notation for (w — 6) \(¢— v). 


General discussion. The Bayes inference explains the probability of a proposition w, while is known 


a proposition ¢. This probability is expressed as follows by the quantity p(w|@) : 


POA) = p()PH 4) - 


From this implicit and probabilistic definition, (¢)|¢) appears more like a mathematical artifice than an 
actual “logical” operator. However, (y|¢) has clearly a meta-logical meaning although it is intuitive and 
just implied: it characterizes the knowledge about w, when a prior information ¢ is known. In this 
section, we are trying to interpret the Bayes operator ( | ) as a logical operator. The author admits 
that this viewpoint seems extremely suspicious: the Bayes inference implies a change of the probabilistic 
universe, and then a change of the truth values! It makes no sense to put at the same level a conditional 
probability with an unconditional probability! But in fact, there are logics which handle multiple truths: 
the modal logics, and more precisely, the multi-modal logics. However, the model we are defining here is 
quite different from the usual modal models. 

From now on, we are assuming a same logic involving the whole operators, te. A, =, V, > and (|), and 


a same probability function p defined over the resulting propositions. 


When defining a logic, a first step is perhaps to enumerate the intuitive properties the new logic should 
have, and then derive new language and rules. Since a probability is based on a Boolean algebra, this 
logic will include the classical logic. A first question arises then: is the Bayes inference (| ) the same 
inference than in classical logic? More precisely, do we have (w|¢) = ¢ — w? If our logical model is 


coherent with the probability, this should imply: 


PCIe) = P(e ¥) = ve Vy) . 


Applying the Bayes rule, it is deduced: 


P(PAY) = V(G)P(Ae V H) = (W(GA ) + (OA 7) — p(@A 7H))- 
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results in } = °.! The Bayes inference 


This is clearly false: eg. taking p(¢ A aw) = E and p(d@Aw~) =} >= 16! 


2 
(w|d) is not a classical inference. Since it is a new kind of inference, we have to explain the meaning of 


this inference. 


The Bayes inference seems to rely on the following principles: 


e Any proposition ¢ induces a sub-universe, entirely characterized by the Bayes operator (-|¢). For 
this reason, (-|¢) could be seen as a conditional modality. But this modality possesses a strange 
quality: the implied sub-universe is essentially classical. From now on, (-|¢) refers both to the 


modality and its induced sub-universe, 


e The sub-universe (-|T) is just the whole universe. The empty universe (-|) is a singularity which 


cannot be manipulated, 


e The sub-universe (-|¢) is a projection of the sup-universe (which could be another sub-universe) 
into ¢. In particular, the axioms of (-|¢) result from the propositions which are axioms within the 


range @ in the sup-universe. Moreover, the modus ponens should work in the sub-universes, 


e Any sub-proposition (w|¢) implies the infered proposition ¢ — w in the sup-universe. This last 
point in not exactly the converse of the previous point. The previous point concerns axioms, while 
any possible propositions are considered here. This (modal-like) difference is necessary and makes 


the distinction between (|) and —, 
e Since sub-universes are classical, the negation has a classical behavior: the double negation vanishes, 


e The sub-universe of a sub-universe is the intersected sub-universe. For example, “considering blue 


animals within a universe of birds” means “considering blue birds”. 


In association with the Bayes inference is the notion of independence between propositions, described by 
the meta-operator x , which is not an operator of the logic. More precisely, ~ is independent to @, ie. 
wx @, when it is equivalent to consider w within the sub-universe ¢ or within the sup-universe. Deciding 
whether this meta-operator is symmetric or not is probably another philosophical issue. In the sequel, 
this hypothesis is made possible in the axiomatization but is not required. Moreover, it seems reasonable 
that complementary propositions like @ and =@ cannot be independent unless ¢ = T. In the following 


discussion, such a rule is proposed but not required. 


8.7.1 Definitions 
8.7.1.1 Bayesian modal language 
The set of the Bayesian propositions bM is constructed recursively: 


e CcbM, 
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e If d,~ € bM, then (w|d) € bM, 


e If d,yebM, then -PGEdM, d6AWEbM, dVYWEbM andd—wPeEebM. 


8.7.1.2 Bayesian logical rules 

The logic over bM obeys the following rules and axioms: 

e Classical axioms and modus ponens , 

b.i. (@|T) = ¢; the sub-universe of T is of course the whole universe, 


b.ii. It is assumed ¥ -d. Then, + 6 > w implies + (w|¢); axioms within the range ¢ are axioms of the 


sub-universe (-|d) , 


b.iii. It is assumed K =¢. Then, F (w > n\¢) > ((w\¢) > (nld)) ; when both an inference and a 
premise are recognized true in a sub-universe, the conclusion also holds true in this sub-universe. 


This property allows the modus ponens within sub-universes, 


b.iv. It is assumed ¥ =. Then, + (w/b) — (¢ > w); the modality (-|b) implies the truth within the 


range ¢, 


b.v. It is assumed K -=¢. Then, 7(77|¢) = (W|¢); there is no doubt within the modality (-|¢). Sub- 
universes have a classical negation operator. However, truth may change depending on the propo- 


sition of reference @, 


b.vi. It is assumed ¥ 7=(¢ A) 2 Then, ((n|w)|¢) = (n|w A @); the sub-universe (-|W) of a sub-universe 


(-|¢) is the intersected sub-universe (-|b Aw) , 


b.vii. & x @ means | (W|¢) © Ww; a proposition w is independent to a proposition b when it makes no 


difference to observe it in the sub-universe (-|¢) or not, 
b.viii. (optional) ~ x ¢ implies ¢ x wW; the independence relation is symmetric, 


b.ix. (optional) Assuming ¢ x w and+ éV wv implies F ¢ or Fw; this uncommon logical rule explains 
that complementary and non trivial propositions cannot be independent. EG. to an extreme degree, 
@ and 7@ are strictly complementary and at the same time are not independent unless 6 = T or 


ga. 


These axioms leave the modality (-|1) undefined, by requiring the condition ¥ =¢ for any deduction on 
the sub-universe (-|¢). In fact, the modality (-|_) is a singularity which cannot be defined according to 
the common axioms and rules. Otherwise, it would be deduced from + L — ¢ that + (|); this last 


deduction working for any ¢ would contradict the negation rule -=(=¢|L) = (¢|L). Nevertheless, the 


axioms b.vii. and b.viii. induces a definition of x for any pair of propositions, except (L, L). 


3It will be proved that the hypothesis K —(¢ A ~) implies the hypotheses K = and ¥ (-7)|¢@) . 
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8.7.2 Properties 
8.7.2.1 Probability over bM 


A probability p over bM is defined according to the definition of section [8-4]. In particular, since the 


meta-operator x characterizes an independence between propositions, it is naturally hypothesized that: 


exw implies p(¢Av) =p(d)p(y) - 


8.7.2.2 Useful theorems 
Sub-universes are classical. It is assumed 4 4d. Then: 
e (=¥/6) = -(Y19), 
© (bAn|d) = (YI¢) A (nl¢) , 
© (WV ld) = Ie) V (mle), 
e (Y > mld) = (Id) > (ald). 


Proof. The first theorem is a consequence of axiom b.v. 


From axiom b.iii., it is deduced + (>w V n|¢) > (>(v|¢) V (n|¢)) . Applying the first theorem, it 
is deduced + (=y V n|¢) > ((>¥|¢) V (m]@)) . At last: 
F (WV mld) > (14) Vv (alg) - (8.20) 
It is deduced F =((#|@) V (n|¢)) > > V n|¢) and, by applying the first theorem, 
F ((-¥19) A (mle) > (“A 16) - 
At last: 
F ((4]¢) A (nlé)) + @ Anld) . 
Now, it is deduced from + ¢ > ((b An) 7) that: 
F (An) > I¢) - 
By applying the axiom b.iii.: 
F (And) > (Ye) - 
It is similarly proved that + (¢~ A n|¢) > (n|@) and finally: 
 (b A mld) > (Id) A (nld)) - 
The second theorem is then proved. 
Third theorem is a consequence of the first and second theorem. 


Last theorem is a consequence of the first and third theorem. 
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Inference property. It is assumed ¥ =¢é. Then (w|¢) A ¢ = Aw. In particular, the hypothesis 
¥ ~(@A wy) implies the hypotheses K 7@ and ¥ (=w|¢) . 


Proof. From b.iv. it comes (¢#|¢) = (¢ — w). Then 7(¢ — W) — 7(|¢) and F (¢Anw) — (79) . 
It follows F (6 A w) — (|) and finally: 


t (PA) > (HIP) A) - 
The converse is more simple. From | (w|¢) > (¢ > w), it follows: 
F (1d) A) > ((6 > H) N49). 


Since (6 — JW) AG = oA, the converse is proved. 


Intra-independence. It is assumed ¥ =@. Then (|) x (v|¢) is equivalently defined by the property 
F (nib) — n]¢9) . 


Proof. 
(nl) — n|¢) = ((nld)|¢) © (mld) = (mld A ¥) © (I) 


= (nl¢A (Hle)) > (ld) = ((al@)|(I4)) © (nid) - 


Independence invariant. ~ x ¢ implies =w x ¢. 


Proof. 


(1¢) o WH = WY10) oe “WH = Wd) Oo”. 


Inter-independence. It is assumed ¥ =¢. Then (w|d) x ¢. 


Proof. From axiom b.vi.: 


(W1e)|d) = WIOA 4) = WI¢) . 
It is deduced (wW|¢) x @. 


Corollary: assuming the rules b.viii. and b.ix., the hypotheses K = and ¥ (=w|¢) imply the hypothesis 

KA(bAY). 

Proof. Assume - =>(6 A w). Then + a(¢ A (|¢)) and + 7=¢ V 7=(~|¢é). Since (=v|d) x ¢, it follows 
x (-y|¢) from rule b.viii. And then =¢ x 7(w|¢) . Now, applying the rule b.ix. tok ~@V (|) , 
it is deduced + a¢ or F (wy ¢). 
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A proposition is true in its proper sub-universe. It is assumed K -¢. Then t (¢|¢). 


Proof. Obvious from + ¢ > @¢. 


Narcissist independence. It is assumed ¥ =¢. Then, ¢x ¢ implies + ¢ and conversely. In particular, 


ox dimplies G=T. 


Proof. 
(016) -d¢=TOg¢=e. 


Non transitivity (modus barbara fails). It is assumed ¥ 7=¢ and K =. Then 


¥ (wld) > ((mlv) > (al¢)) - 


Proof. The choice » = T, 7 =7¢ and ¢#T is a counter example: 


(Td) > ((-9T) = (419) = T > (“6 > L) =. 


8.7.2.3. Axioms and rules extend to sub-universes 


Assume ¥ ad. The rules and axioms of bM extend on the sub-universe (-|¢) : 
e | w implies F (¢|¢) , 
e It is assumed ¥ =(¢ A). Then + ( — 7¢) implies + ((n\v)|¢) , 


e It is assumed K =(6A W). Then ((n = ¢\w)|¢) => (( (nl) — (Cl) )\o), 


e It is assumed K =(¢A Ww). Then + ((nlb)|4) = (¥ > 7). 


Proof. + q implies + ¢ > w and then + (x)|¢). First point is then proved. 
It is successively implied from + ( — ml¢): 

F (Wd) > (nl) , 

F ((nl¢)|(WI4)) , 

F (nloA (I¢)) , 

r ( 

F 


ONY), 
(n|b)|¢) . 


Ui 
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Second point is then proved. 
By applying axiom b.iii. and first point, it comes: 
F ((n Clb) > (nh) + IY) ¢) - 


It follows: 
F ((n > ¢d)|4) > (lv) > (Cl) 4) - 
Third point is proved. 


By applying axiom b.iv. and first point, it comes: 


F ((nlb) > (b > n)|¢) 


It follows: 
F ((nlb)|¢) > (b > al¢). 


Fourth point is proved. 


8.7.2.4 Bayes inference 


It is assumed ¥ =¢. Define p(w|¢) as an abbreviation for P((w|¢)) . Then: 


P(L1)P(d) = D(PAY) - 


Proof. This result is implied by the theorems (~|¢) \¢d = ¢A wv and (w|¢) x ¢. 


8.7.2.5 Conclusion 


Finally, the Bayes inference has been recovered from our axiomatization of the operator (-|-). Although 
this result needs more investigation, in particular for the justification of the coherence of bM , it appears 
that the Bayesian inference could be interpreted logically as a manner to handle the knowledges. A 
similar result has been obtained for the fusion rule of DSmT. At last, it seems possible to conjecture that 


logics and probability could be mixed in order to derive many other belief rules or inferences. 


8.8 Conclusion 


In this contribution, it has been shown that DSmT was interpretable in the paradigm of probabilized 
multi-modal logic. This logical characterization has made apparent the true necessity of an independence 


hypothesis about the sensors, when applying the & fusion rule. Moreover, it is expected that our work 
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has given some clarifications about the semantic associated with the conjunctive rule of DSmT. 

A similar logical interpretation of the Bayes inference has been constructed, although this preliminary 
work should be improved. At last, it seems possible to handle probabilized logics as a relatively general 
framework for manipulating non deterministic informations. This is perhaps a generic method for con- 
structing new customized belief theories. The principle is first to construct a logic well adapted to the 
problem, second to probabilize this logic, and third to derive the implied new belief theory (and forget 


then the mother logic!) : 


Probabilized 
Classical Logic Dapper eo Probability 
a 
New Logic _ New Belief Theory 
propositions 


It seems obviously that there could be many theories and rules for manipulating non deterministic infor- 
mations. This is not a new result and I feel necessary to refer to the works of Sombo, Lefevre, De Brucq 


and al. {6} [4] [7] , which have investigated such questions. 


At last, a common framework for both DSmT and Bayesian inference could be certainly derived by fusing 


the logics mM and bM. 
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